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This application note describes a “no hassles” interface between the 
IBM PC-style backplane and a TMS320C30 DSP chip via an IDT dual- 
port static RAM. The interface provides an extremely simple means of 
downloading cross-compiled DSP code as well as sample data sets for 
debugging a high speed TMS320 based system in real time. 

This example also shows how easily interprocessor communications 
hardwarecanbeimplemented via the simple insertion ofadual-portSRAM 
between a DSP chip anda general purpose processor in astandard DSP 
system. Asystemlike this one wouldtypically use astandard CPU for data 
input/output and ordering, andwould pass complete data sets tothe DSP 
chip forintense calculation. Similar architectures are often usedin graphics 
andimage processing, where an entire image is manipulated as a single 
data set, in transform calculations (i.e. FFTs) for sonar and radar 
processing. Certain systems even use this scheme several times with 
numerous DSP chips in order to get processing speeds proportional to the 
number of DSP chips inthe system. 


System Objective 

The design presented hereis the TMS320C30 Software Development 
Board. This boardis one portion of asystem which helps the TMS320C30 
programmer to download and debug code from an IBM PC or similar 
computer. In order to support the special hardware needs of the 
TMS320C30 programmer, an expansion connector allows memory tobe 
added to the DSP chip's primary bus, while a target connector provides 
a fully buffered version of the chip’s expansion bus to allowits connection 
tospecial purpose hardware. Mostofthe TMS320C30’s status signals are 
also routed to the expansion bus to make them available to the hardware 
being debugged. 

The majority ofthe control software is PC-resident, andis providedon 
magnetic media. Thisincludes such tools as the assembler, compilers, and 
download and debug routines. A 2K x 32 EPROM array on the primary 
bus ofthe TMS320 provides the host processor with a set of commands 
toallowittoloadthe software development board's RAMs, to setandclear 
breakpoints, to examine and preset internal status, and to load or store 
values in individual memory locations. All of these are controlled by the 
host’s sending acommandto the TMS320, which interprets thatcommand 
and takes appropriate action. 

Ahighspeed 16K x 32 static RAMis attached to each of the DSP chip’s 
two buses: the expansion bus, andthe primary bus. The expansion bus’ 
SRAM would typically be used to store a data set to be operated upon, 


and the primary bus’ RAM would be used to store code which would be 
debugged .usingthis board. Both ofthese SRAMs are zero wait-state (25ns 
access times at 33MHz) to allow real-time debugging andbenchmarks to 
be performed. Since the TMS320's expansion bus only supports address- 
ing ofupto 8K locations, abankselectsignalis used to switch between the 
upper and lower halves of this port’s 16K x 32 memory. This signal is 
software-controlled from the processor's expansion bus. 

One design goal for this system was to move data into and out of the 
DSP’s dedicated memory without taking an inordinate amount of time or 
hardware. If standard memory were to be shared between the host and 
the DSP chip, multiplexing logic would need to be inserted between each 
processor andthe RAM’s address, data, andcontrollines. This logic would 
finditselfrightin the critical timing path of the memories onthe primary and 
expansion buses, and would make zero wait-state operation nearly 
unachievable. An additional headache would have been finding room on 
the board forthe large amount of multiplexing logic required. Should the 
design have used a simplermethod of passing databack and forth between 
processors either via a UART or a single byte-wide I/O buffer, the 
developer would have had to endured long delays during download and 
other communication functions as the software on either side ofthe port 
performed massive amounts of handshaking to pass even the smallest of 
datasets. 

Itbecame obvious early in the design cycle that the simplest method 
of performing fasthostto DSP communication wouldbe to use alarge high- 
speed true dual-port static RAM to performinterprocessor communica- 
tions. A dual-port RAM would allow both the host and the DSP chip to 
transfer data in packets, rather than as individual bits or bytes, thus 
accelerating downloading. The selected dual-port device would have to 
be one which providedsome means of signalling that data packets were 
ready to be handed back and forth between processors. 

An|DT71342was chosen because ofits speed, its depth (4K bytes), 
the simplicity of its interface, and its ability to perform interprocessor 
communications through its eightinternal semaphore flags (see Appendix: 
“Dual-Port Semaphores’). By using an IDT71342, the designers could 
use asingle chip toimplement 4K byte high speed blocktransfers between 
the hostandthe TMS320, andto signalthe completion of atransfer without 
additional hardware. Although the 45ns access time dual-portusedin this 
system does not support zero-wait data transfers at maximum CPU 
speeds, datatransfers are notin the critical path of the sort of software this 
system is used to debug. A true zero wait-state system could have been 
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Figure 1. TMS320C30 Software Development Board Block Diagram 


realized had the designers used a 25ns dual-port. 
Figure 1 shows a block diagram of the complete system. The full 
schematic ofthe system is shownin Figure 6. 


Interfacing to the Dual-Port SRAM 

The IDT71342 dual-port RAM uses aninterface whichis similarto any 
standard single-port byte wide static RAM. Each of the two ports (leftand 
right) uses a separate setof control, address, and |/O pins. Address inputs 
arenotmultiplexed with data |/O. The controlinterface consists of three pins 
on either side: read/write (R/W), output enable (OE), and chip enable 
(CE). The R/W and OE pins also operate in conjunction with the 
semaphore select pin (SEM), which imitates the functionality of the chip 
enable pin, but rather than allowing reads and writes ofthe memory array, 
this pin routes the read and write control to the eight on-chip semaphore 
flags. 

Write cycles are controlled by the simultaneous application of alogic 
LOWonboththe CE and R/Winputs for one side of the SRAM, andeither 
signal canbe used to control the timing of awrite cycle. Ifthe CE signalis 
held low and the timing is set by a LOW pulse on the R/W pin, itis called 
a “RW controlled write cycle” (figure 2). Write cycles where R/W stays 
low while CE is pulsed low are called “CE controlled write cycles” 
(figure 3). By offering both methods of communication, IDT’s dual-port 
SRAMs canbe easily connected between systems with greatly differing 
bus interface specifications. An interesting point about this design is 
thatwhile the PC orhostside of the dual-port uses a RW controlled write 
cycle, the DSP writes toits side of the dual-port by using aCE controlled 
write cycle. 


The PC Bus Interface 

Inthis design, the PC bus control signals are routed nearly directly from 
the backplane to the IDT71342’s R/W and OE pins. The signal functions 
and timing of the backplane are an ideal match with those of the dual-port 
RAM. However, adecisionwas made to map the memory array intoa4K 
space in the PC’s memory space, while the semaphores were to be 
mappedinto the PC’s I/O space, which forced the Ow and MEMw signals 
tobe ORedbefore driving them into the IDT71342’s RW input. Likewise 
IORandMEMR signals are ORed before driving theminto the IDT71342's 
OE input. 

The dual-port’s chip enable (CE) pinis driven indirectly by an address 
decoder consisting ofan eightbit comparator 74ALS521 which compares 
the output of a74LS377 register with addresses A12-A19. The 74LS377 
isan l/O mapped register that allows the dual-port SRAM to be mapped 
into any 4K-byte regioninthe PC’s main memory space. A PAL resident 
control register bit on the board allows the dual-portmemory tobe disabled, 
whichis its state at power-up or reset. 

The semaphore enable pin (SEM) is driven by a 20L8 PAL which 
decodes addresses fromthe PC Bus. This decoder determines whether 
the hostis accessing memory or I/O space via the MEMR, MEMw, |Or, 
andlOwsignals, and enables the semaphores during an|/O access ifthe 
proper address (Ao-Ag) is applied to the inputs of the PAL. The PAL also 
uses the IOw and MEMwsignals to generate a R/Wcontrolledwrite cycle, 
while using decoded addresses to drive the CE and SEM inputs. 
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Figure 2. Timing Waveform of Write Cycle No. 1, R/W Controlled Timing 
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NOTES: 

RW or CE must be HIGH during all address transitions. 

A write occurs during the overlap (tew or twe) of either CE or SEM = Vit and RW = Vit. 

twr is measured from the earlier of CE or RAW going HIGH to the end-of-write cycle. 

During this period, the I/O pins are in the output state, and input signals must not be applied. 

lf the CE LOW transition occurs simultaneously with or after the R/W LOW transition, the outputs remain in the High-impedance state. 

Timing depends on which enable signal (CE or R/W) is asserted last. 

This parameter is guaranteed by device characterization, but is not production tested. Transition is measured +500mV from steady state with the Output Test Load 

(Figure 2). 

8. If OE is LOW during a RWW controlled write cycle, the write pulse width must be the larger of twp or (twz + tow) to allow the I/O drivers to turn off data to be placed 
on the bus for the required tow. If OE is HIGH during an R/W controlled write cycle, this requirement does not apply and the write pulse can be as short as the 
specified twp. 

9. To access SRAM, CE =Vit and SEM = Vin. To access semaphore, CE = Vin and SEM = ViL. Either condition must be valid for the entire tew time. 
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Figure 3. Timing Waveform of Write Cycle No. 2, CE Controlled timing!" 
NOTES: __ 
1. RW or CE must be HIGH during all address transitions, = 
2. A-write occurs during the overlap (tew or twp) of either CE or SEM = Vit and R/W = VIL. 
3. {wr is measured from the earlier of CE or R/W going HIGH to the end-of-write cycle. 
4. If the CE LOW transition occurs simultaneously with or after the R/W LOW transition, the outputs remain in the High-impedance state. 
5. Timing depends on which enable signal (CE or R/W) is asserted last. — 
6. To access SRAM, CE =Vit and SEM = ViH. To access semaphore, CE = VIH and SEM = VIL. Either condition must be valid for the entire tew time. 


All data and address pins of the IDT71342 are isolated from the 
backplane with TTL buffers. A detail of the PC to dual-port interface is 
shown in Figure 4. 

The reader should note that several considerations increased the 
complexity of this interface. If this design had involved a dedicated host 
processor rather than ageneral purpose PC, the need for buffering would 
probably have been drastically reduced. Hadboththe 4K byte SRAM and 
the semaphores been mapped into the memory space of the host, no ORing 


would have been required onthe MEMw, MEMr, Ow, and!Or signals. 
Finally, avery complex address decoderwasimple-mentedin this system 
to allow the IDT71342’s RAM to be mapped anywhere within the PC’s 
memory space. By using amore straightforward fixed-address scheme, 
logic complexity could be significantly reduced. Itis conceivable that the 
entire interface including address decoding could have been handled with 
asingle IC. 
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Figure 4. PC Bus to IDT71342 Dual-Port Interface (Left-Hand Side of Dual-Port). 
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Figure 5. TMS320C30 to IDT71342 Dual-Port Interface (Right-Hand Side of Dual-Port). 
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The TMS320C30 Interface 

The TMS320 interfaces to the dual-port SRAM through the I/O strobe 
onthe expansion bus. The same bus is used to interface to a 16K x 32 
static RAM via its memory strobe signal. These two strobes signify two 
different ranges on the DSP chip’s internal address map. A detailed 
diagram of the TMS320 to IDT71342 interface is shown in Figure 5. 

Asinthe PC businterface, the address lines are buffered between the 
processor andthe dual-port SRAM, however the light loading onthe data 
bus removes the need for data buffering on this side. The only devices 
connected to the datapins are: the dual-port SRAM, the DSP chip, a static 
RAM, astatus latch, and a transceiver. The address bus needed buffering 
since all eight 16K x 4 RAM chips, as well as the dual-port, a PAL, and 
anaddress buffer are attached to these pins. 

The TMS320’s expansion bus uses a strobe to activate an I/O cycle, 
and alevelto distinguish read cycles from write cycles. In this design, the 
expansion read/write (XR/W) output of the TMS320is connected directly 
tothe IDT71342 dual-portto drive the read/write (R/W) input, andis simply 
inverted to drive the output enable (OE) input. This inverter is not truly 
necessary, since the dual-port places its data outputs into a high- 
impedence state automatically upon the application of awrite (LOW) level 
onthe R/Winput. The OE pin on this side could have been permanently 
tied active (grounded). 

A20L8 PALis used to control the chip enable (CE) input for this side 
ofthe dual-port SRAM. This signal is adecoding ofthe DSP’s expansion 
bus address bits XAo-XA12. The PAL used in this interface had too few 
product terms to allow the combination of the I/O strobe with the decoded 
address, so the buffered I/O strobe (BIOSTRB) has been externally 
ANDedwith the decoded address output from the PAL before being fed 
into the dual-port. The semaphore selectis handled the same way, buta 
different address decoding is used from the same PAL, andthe |/O strobe 
is ANDed through a different gate into the semaphore (SEM) inputof the 
dual-port. Both of these signals can be disabled by writing to the control 
register. 

The TMS320C30 writes to the dual-port SRAM by implementing a CE 
controlled write cycle. The CE and SEM inputs are driven by two-input 
OR gates. One of the inputs of each of these gates represents a decoded 
address output from a 20L8 PAL, while the second input is driven by a 
buffered version of the I/O strobe. The only other qualifying input is the 
read/write (R/W) input, which is directly driven by the expansion read/ 
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write (XR/W) signal onthe TMS320. When the DSP chip writes to the dual- 
port, the address and read/write signals are output first, followed by the 
/O strobe. Since IOSTRBis used to gate the CE or SEM signal, the timing 
meets the criteria fora CE controlled write cycle. 

The expansion ready (XRDY) input to the TMS320, which tells itthat 
the expansion bus cycle is complete, is a combination of the decoded 
address range from the PAL and a clock delay from the TMS320’s H1 
(clock/2) output. This signal is required for systems using slower dual-port 
RAMs, butis notnecessary in systems where faster dual-ports are used. 
If the system designer choses a 25ns or faster part for use ina 33MHz 
TMS320C30 system, the XRDY input canbe generated immediately upon 
accessing the dual-port RAM. 

The gating used here generates a single wait-state on any I/O strobe 
within the address range of the IDT71342. This logic could be removed 
ifafaster IDT71342 were used. On anlOSTRB output from the TMS320, 
if the PAL decodes a dual-ported address, the strobe and decoded 
address are combined in the second ofthe two AND gates in Fig. 5. This 
AND gate’s output is fed into the XRDY OR gate to extend 
the expansion bus cycle. On the next rising edge of H1, the IOSTRB is 
clockedinto the flip flop. This flip flop’s outputis connected tothe first AND 
gate anddisables the IOSTRB from reaching the second AND gate. This 
inturn allows the XRDY inputto the TMS320 to go active, and allows the 
cycle to end. A single wait-state more than compensates for the 45ns 
address access time ofthe dual-portusedinthis application. Other signals 
called target I/O ready (TIORDY) from the target connector, and the 
MSTRB signal from the DSP chipitself can also signal an expansion bus 
ready state. Since the MSTRB signalis used only to control accesses to 
the expansion bus’ 16K x 32 zero wait-state RAM, itis ORed directly back 
to the XRDY input through the 74AS11 gate as shown. 


Conclusion 

The TMS320C30 Software Development Board shows the simplicity 
of designing an interface between a TMS320 DSP chip and the IBM PC 
bus using an IDT71342 dual-port RAM. The dual-port RAM serves to 
reduce component count, increase interprocessor communications 
throughput, and simplify design. Designers shouldbe able to follow the 
example given here to profitably use dual-port SRAMs to handle 
communications in any similar dual processor system. 
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Appendix 


Dual-Port Semaphores 

Eight extra address locations in the IDT71342 4K x 8 dual-port RAM 
are dedicated to binary semaphore flags. These flags allow either the 
TMS320 orthe hostprocessor to claim aprivilege over the other processor 
forfunctions defined by the programmer's software. As an example, the 
semaphore can be used by the PC to inhibit the TMS320C30 from 
accessing a portion of the dual-port SRAM, or some other shared 
resource. 

The dual-port SRAM features a fast access time, and both ports are 
completely independent of each other. This means thatthe activity onthe 
left portin no way slows the access time of the right port. Both ports are 
identicalinfunctionto standard static RAMs andcanbe read from, orwritten 
to, at the same time with the only possible conflict arising from the 
simultaneous writing of, or a simultaneous READ/WRITE of, a non- 
semaphore location. Semaphores are protected against such ambiguous 
situations and may be used by the system program to avoid any conflicts 
inthe non-semaphore portion of the dual-port SRAM. 

Multiple processor sytems like the TMS320C30 Software Develop- 
ment Board can benefit from a performance increase by using these 
semaphores, which provide a lockout mechanism without requiring 
complex programming. 

Software handshaking between processors offers the maximum in 
system flexibility by permitting shared resources to be allocatedin varying 
configurations. The IDT71342 does not use its semaphore flags to control 
any resources through hardware, thus allowing the programmer to 
determine each flag’s meaning. 


How the Semaphore Flags Work 

The semaphore logicis a set of eight latches which are independent 
ofthe dual-port SRAM. These latches canbe used to pass a flag, ortoken, 
from one processor tothe other to indicate that a shared resourceisinuse. 
The semaphores provide a hardware assist for ause assignment called 
“Token Passing Allocation.” Inthis method, the state ofasemaphore latch 
isusedas atoken indicating thata shared resourceis in use. Ifthe TMS320 
wants to use this resource, itrequests the token by writing azerointo the 
latch. The TMS320 then verifies its success in writing the latch by reading 
it. If it was successful, it proceeds to assume control over the shared 
resource. Ifitwas notsuccessfulin writing azero into the latch, itdetermines 
thatthe PC had setthe latch first, isin posession ofthe token, andis using 


the shared resource. The TMS320 can then either repeatedly inquire the 
status of the semaphore it requested, oritcan remove its request for that 
semaphore by writing a one intoits location. The TMS320canthen perform 
another task and occasionally attemptto gain control ofthe token via the 
set and test sequence. Once the PC has relinquished the token, the 
TMS320 can succeed in gaining control of the shared resource. 

The semaphore flags are active low. A token is requested by writing 
a zero into a semaphore location, and is released when the same 
processor writes a one into that location. 

The eight semaphore flags reside within the IDT71342inaseperate 
memory space from the dual-port RAM. This address space is accessed 
by placing a low input on the SEM pin (whichis used as a chip select for 
the semaphore flags), andusingthe other control pins (Address, OE, and 
R/W) as they would be usedin accessing astandard static SRAM. Each 
of the flags has a unique address which can be accessed by either side 
through address pins Ao - A2. When accessing the semaphores, none of 
the other address pins has any effect. 

When writing toasemaphore, only data pin DO is used. Ifa low level 
iswritten into an unused semaphore location, that flag willbe setto azero 
onthat side anda one on the other (see Table |). That location can now 
only be modified by the side showing the zero. When a oneis written into 
the same location from the same side, the flag willbe setto aone for both 
sides (unless a semaphore request from the other side is pending) and 
then canbe written to by both sides. The fact thatthe side whichis able to 
write azerointo asemaphore subsequently locks out writes fromthe other 
side is whatmakes semaphore flags useful in interprocessor communica- 
tions. Azerowritten into the same location fromthe other side willbe stored 
inthe semaphore request latch for that side until the semaphore is freed 
by the first side. 

When asemaphore flagis read, its value is spread into all databits, so 
thata “set” flag reads as a one in all data bits anda flag containing azero 
reads as allzeros. The read value is latched into one side’s output register 
whenthatside’s semaphore select (SEM) and output enable (OE) signals 
go active. This serves to disallow the semaphore from changing state in 
the middle ofaread cycle dueto awrite cycle from the other side. Because 
ofthis latch, arepeated read ofasemaphore inatestloop mustcause either 
signal (SEM or OE) to go inactive, or the output will never change. This 
isnotaconcernin the TMS320C30 Software Development Board, since 
either bus’ accesses to other memory locations between semaphore 
accesses inactivate both of these signals for a relatively long period no 
matter how tight of aloop is used to interrogate the device. 
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Table 1. Example Semaphore Procurement Sequence 


Asequence of WRITE/READ mustbe used to acquire a semaphore 
inorderto guarantee thatno system level contention willoccur. A processor 
requests access to shared resources by attempting to write azerointoa 
semaphore location. Ifthe semaphore is already in use, the semaphore 
requestlatch will contain a zero, yetthe semaphore flag willappear as a 
one, a fact which the processor will verify by the subsequent read (see 
Table |). Asan example, assume the PC writes a zero to the left portata 
free semaphore location. On asubsequent read, the PC will verify that 
ithas written successfully to that location and willassume control over the 
resource in question. Meanwhile, ifthe TMS320 attempts to write a zero 
tothe same semaphore flag, it will fail, as will be verified by the fact thatit 
will read aone from that semaphore during a subsequent read cycle. Had 
a sequence of READ/WRITE been used instead, contention problems 
could have occurred during the gap between the read and write cycles. 

Itisimportantto note that a failed semaphore request mustbe followed 
either by repeated reads, or by writing a one into the same location to 
remove the semaphore request. The reason for this is easily understood 
by looking atthe simple logic diagram of asemaphore flag shown in Figure 
6. Twosemaphore requestlatches feedinto asemaphore flag. Whichever 
latch is the firstto presenta zero to the semaphore flag will force its side of 
the semaphore flag low, andthe other side high. This condition will continue 
until a one is written into the same semaphore requestlatch. Should the 
other side’s semaphore request latch have been written to a zero in the 
meantime, the semaphore flag will flip over to the otherside as soonasa 
one is written into the first side’s requestlatch. The second side's flag will 


now stay low until its semaphore request latch is written with a one. From 
this it is easy to understand that, if a semaphore is requested and the 
processor which requested it no longer needs the resource, the entire 
system could hang up until a one is written into that semaphore request 
latch. 

The critical case of semaphore timing is when both sides request a 
single token by attempting to write a zero into it at the same time. The 
semaphore logicis specially designed to resolve this problem. If simulta- 
neous requests are made, the logic guarantees that only oneside receives 
the token. Ifonesideis earlier than the other in making the request, the first 
side to make the request will receive the token. If both requests happen 
atthe sametime, the assignmentwillbe arbitrarily made to one side or the 
other. 

One caution that should be noted when using semaphores is that 
semaphores alone do not guarantee that access to a resource is secure. 
Aswith any powerful programming technique, ifsemaphores are misused 
or misinterpreted a software error can easily happen. Code integrity is of 
the utmostimportance when semaphores are used instead of hardware 
handshaking. 

Initialization ofthe semaphores is not automatic and mustbe handled 
via the initialization program at power-up. Since any semaphore whichis 
writtento azeromustbe resettoaone, both the TMS320 andthe PC must 
write a one into allsemaphore locations atinitialization to assure that the 
semaphores will be free when needed. 


Dual-Port RAM Simplifies PC-to-TMS320 Interface 


L Port 
Semaphore 
RequestLatch 
Do 
Write 


Semaphore —« 


Application Note AN-68 


R Port 
Semaphore 
RequestLatch 
Do 
Write 


Read 


Semaphore Latch 


B Semaphore 
Read 


2694 drw 06 


Figure 6. IDT71342 Semaphore Logic 


Using Semaphores -Some Examples 

Perhaps the simplest application of semaphores is their application as 
resource markers for the IDT71342’s dual-port SRAM. Say the 4K x 8 
SRAM was tobe divided into two 2K x8 blocks, which were tobe dedicated 
at any one time to servicing either the PC or the TMS320. Semaphore 
0 could be used to indicate the side which would control 
the lower section of memory, and Semaphore 1 couldbe defined as the 
indicator forthe upper section of memory. 

Totake aresoutce, inthis example the lower 2K of dual-port RAM, the 
PC could write then read a zero into Semaphore 0. If this task was 
successfully completed (azero was read back, rather thanaone), the PC 
would assume control of the lower 2K. Meanwhile, the TMS320 might 
attemptto perform the same function. Since the TMS320 was attempting 
to gain control of the resource after the PC, it would read back a one in 
responseto the zeroithad attemptedtowriteinto Semaphore 0. Atthis point, 
the TMS320’s software could choose totry and gain control ofthe second 
2K section by writing, then reading azerointo Semaphore 1. Ifitsucceeded 
in gaining control, it would lock out the PC. 

Oncethe PC was finished withits task, itwould writeaoneto Semaphore 
0, then may try to gain access to Semaphore 1. If Semaphore 1 was still 
occupied by the TMS320, the PC could remove its semaphore request 
and perform other tasks until it was able to write, then read 
a zero into Semaphore 1. If the TMS320 performs a similar task with 
Semaphore 0, this protocol would allow the two processors to swap 2K 
blocks of dual-port RAM with each other. 

The blocks do not have to be any particular size and could even be 
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of variable length, depending upon the complexity of the software using 
the semaphore flags. All eight semaphores could be used to divide the dual- 
port RAM or other shared resources into eight parts. 

Semaphores are a useful form of arbitration in real-time DSP applica- 
tions, when the PC must be locked out of a section of memory during a 
transfer, andthe TMS320 cannottolerate any wait states. With the use of 
semaphores, once the two processors had determined which memory 
area was “offlimits” tothe PC, both the PC andthe TMS320 could access 
their assigned portions of memory continuously without any wait states. 
Both processors can access their assigned RAMsegments atfull speed. 

Another application of semaphores is in the area of complex data 
structures. In this case, block arbitration is very important to the main- 
tenence of data integrity. For this application one processor may be 
responsible for building and updating a data structure, which the other 
processor then reads andinterprets. Ifthe interpreting processor reads 
anincomplete data structure, a major error condition may exist. Therefore, 
some sort of arbitration mustbe used between the TMS320 andthe PC. 
Software semaphores are aperfectfit. The building processor uses the 
semaphore to arbitrate for the block and to lock it once that processors 
able to acquire the semaphore flag. This processor then is able to go in 
and update the data structure. When the update is completed, the 
semaphore and the corresponding data structure block are released. The 
interpreting processor then acquires the semaphore which allows it to 
come back and read the complete data structure, thereby guaranteeing 
consistency. 
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